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ABSTRACT 

An energy efficient distributed Change Detection scheme based on 
Page's CUSUM algorithm was presented in (2J- I n mls paper we 
consider a nonparametric version of this algorithm. In the algo- 
rithm in 1 2 1, each sensor runs CUSUM and transmits only when 
the CUSUM is above some threshold. The transmissions from the 
sensors are fused at the physical layer. The channel is modeled as 
a Multiple Access Channel (MAC) corrupted with noise. The fu- 
sion center performs another CUSUM to detect the change. In this 
paper, we generalize the algorithm to also include nonparametric 
CUSUM and provide a unified analysis. 

Categories and Subject Descriptors 

G.3 [Probability and Statistics]: distributed functions, renewal 
theory, queueing theory, stochastic processes; H.3.4 [Systems and 
Software]: distributed systems 

General Terms 

Algorithms, Design, Performance, Theory 

Keywords 

Nonparametric CUSUM, Decentralized Change Detection, Reflected 
Random Walk. 

1. INTRODUCTION 

In the problem of distributed change detection, there are multi- 
ple geographically distributed sensors, each sensing a sequence of 
observations. The distribution of the observations of all the sen- 
sors changes simultaneously at some random point of time, and the 
observations are independent and identically distributed (iid) con- 
ditioned on the time of change. The sensors send processed ver- 
sion (e.g., scaled or quantized) of their observations to a decision 
maker (fusion center) and the fusion center fuses the information 
from various sensors to detect the change of law as soon as possi- 
ble. The performance metric is a measure of the number of extra 
observations taken after the distribution has changed under some 
false alarm constraint (the constraint corresponds to the cost asso- 
ciated with declaring a change when it has not taken place). This 
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model finds application in biomedical signal processing, intrusion 
detection in computer and sensor networks ( 1191 , 1181 ), finance, 
quality control engineering, and recently, distributed detection of 
the primary in cognitive radio networks (Q, [ 15 1). 

In the Bayesian formulation of the change detection problem, 
where the distribution of the change variable is known, the objec- 
tive is to minimize the mean delay of detection subject to probabil- 
ity of false alarm. The optimal solution is obtained in (161 when 
the sensors transmit the raw observations (and hence a centralized 
solution is feasible), and in 1191 when the sensors transmit a quan- 
tized version of the observations. 

In the Min-Max formulation, when no knowledge is assumed of 
the distribution of change, the worst case delay is minimized sub- 
ject to a constraint on the mean time to false alarm. When the 
sensors send the observations in raw, the CUSUM algorithm (first 
proposed by Page in 1121 ) was shown to be optimal in (8) and 
1111 . When the sensors process the information before transmit- 
ting, CUSUM based schemes were shown to be optimal in 1 101 . 

The above algorithms minimize delay subject to constraints on 
only false alarm. Hence, it is not clear if they are also energy effi- 
cient. As a result, the distributed algorithm which optimizes delay 
under any of the above two formulations, under both false alarm 
and energy constraints, is not yet known. Recently, a Bayesian for- 
mulation of the decentralized change detection problem with en- 
ergy constraints was considered in 1201 . The problem is solved by 
restricting the solution to a class of algorithms where the sensors 
send scaled and shifted versions of their observations to the fusion 
center. The algorithm in 1201 is energy efficient but leaves out a 
class of algorithms where the sensors can decide not to transmit 
based on the available information. This latter technique can be 
used to save more energy. 

This issue is exploited in to propose a CUSUM based algo- 
rithm called DualCUSUM. This new algorithm was shown to per- 
form much better than the one in [20] and the gap in performance 
was shown to increase at lower energy and false alarm constraints. 

In this paper we provide the false alarm and delay analysis of Du- 
alCUSUM. We also extend DualCUSUM to detect changes when 
the exact information about the pre-change and post-change distri- 
butions is not available (nonparametric setting) and provide a uni- 
fied analysis. 

As will be seen from the paper, the analysis of such distributed 
cooperative systems is very challenging. We use results from Re- 
newal Theory, Extreme Value Theory and theory of Brownian Mo- 
tion to analyse our algorithm. Since CUSUM is, or will be a fun- 
damental element of many distributed algorithms for detection of 
change, the tools and techniques used here can be used to analyse 
those other algorithms as well. 

DualCUSUM has been used for cooperative spectrum sensing in 



Cognitive Radio Systems also in 11151 and shown to provide better 
performance than other algorithms available in literature not only 
in delay in detection but also in saving energy. 

The knowledge of pre-change and post-change distributions may 
not be practical in many cases: random time varying fading in the 
wireless channels (0, (7)) and due to lack of information related 
to transmit power and/or receiver noise ( |17| ). Thus in this paper 
we also extend DualCUSUM to a nonparametric set up. 

We analyse the generalized version of DualCUSUM of which 
parametric and nonparametric versions are special cases. A few 
interesting facts emerge from this analysis: mean detection delay 
is independent of the distribution of the observations but the false 
alarm probability crucially depends on the tail behavior of the dis- 
tributions (at least for the nonparametric case). The lighter the 
tail, the lower the false alarm probability. Therefore for a given 
false alarm constraint, under both parametric and nonparametric 
settings, a system where the observations are lighter tailed, will 
have lower detection delay. 

We also show that the log likelihood function converts a heavy 
tailed distribution to a light tail distribution, or in general, makes 
the tail of the distribution lighter. Since, parametric CUSUM uses 
log likelihood and nonparametric CUSUM does not, the former 
performs better than the latter for a given distribution of observa- 
tions. This interesting property of log likelihood and its implica- 
tions for CUSUM seem to have gone unnoticed so far. 

The paper is organized as follows. We explain the model and 
introduce the algorithm in Section[2] Section[3]analyzes the perfor- 
mance of the algorithm and provides comparison with simulations. 
Section[4]concludes the paper. 

2. MODEL AND ALGORITHM 

Let there be L sensors in a sensor field, sensing observations and 
transmitting to a fusion node. The transmissions from the sensor 
nodes to the fusion node are via a MAC. In our system we assume 
that all the sensor nodes can transmit at the same time. There is 
physical layer fusion at the fusion node (commonly studied Gaus- 
sian MAC is a special case). The fusion node receives data over 
time and decides if there is a change in distribution of the observa- 
tions at the sensors. 

Let X^i be the observation made at sensor / at time k. Sensor 
/ transmits Y k j at time k after processing X k j and its past obser- 
vations. The fusion node receives Y k = Ykl +Zmac.*> where 
{Zmaca} is hi receiver noise. The distribution of the observations 
at each sensor changes simultaneously at a random time T, with a 
known distribution. The {X k j,I > 1} are independent and identi- 
cally distributed (iid) over / and are independent over k, conditioned 
on change time T. Before the change X k j have density /o and after 
the change the density is f\ . The expectation under /, will be de- 
noted by Ej, i = 0, 1, and Poo and P\ denote the probability measure 
under no change and when change happens at time 1, respectively. 
These assumptions are commonly made in the literature (see e.g., 
fTO) and Q9)) 

The objective of the fusion center is to detect this change as soon 
as possible at time T (say) using the messages transmitted from all 
the L sensors, subject to an upper bound a on the probability of 

False Alarm Pp^ = P(t < T) and Sq on the average energy used. 
Often the desired a is quite low, e.g., < 1CT 6 in intrusion detection 
in sensor networks. Then, the general problem is: 



min Eoo =E[(t — T) 



Subj to PfA < a and S avg = E 



k,l 



Our algorithm in (2) does not provide an optimal solution to the 
problem but uses several desirable features to provide better per- 
formance than the algorithms we are aware of. We reproduce the 
algorithm, DualCUSUM, for an easy reference: 

1. Sensor / uses CUSUM, 

W U = max (0, Wk-it + log [/1 {X kJ ) // {X u ) ] ) , (2) 

where, W j = 0, 1 < / < L. 

2. Sensor I transmits Y k j = bl{w u >y}- Here 1^ denotes the in- 
dicator function of set A and b is a design parameter. 

3. Physical layer fusion (as in j20|) is used to reduce transmis- 
sion delay, i.e., Y k = £f =1 Y kJ + Z MACtk , where Z M AC,k is the 
receiver noise. 

4. Finally, Fusion center runs CUSUM: 



F k = mwtl 0,F k _i +log 



giiXk) 
go(Y k ) 



F = 0, 



(3) 



where go is the density of ZmacA' m e MAC noise at the fu- 
sion node, and gj is the density of Z MAC k + bl, 1 being a 
design parameter. 

5. The fusion center declares a change at time t(/3, y,b,I) when 
F k crosses a threshold jS: r(f},y,b,I) = infffc : F k > j8}. 

Multiple values of (jS, y,b,I) will satisfy both the false alarm and 

the energy constraint. One can minimize Eod = E[(z — T) + ] over 
this parameter set. In Section[3]we obtain the performance of Du- 
alCUSUM for given values (j6,y,Z>,/) which then can be used in 
the optimization algorithm developed in (2) to efficiently solve the 
optimization problem above. 

DualCUSUM, as the original CUSUM itself, has a strong limi- 
tation. It requires exact knowledge of /o and f\ . This information 
will be available apriori to varying degrees in a practical scenario. 
Depending upon the type of uncertainty in /o,/i, different algo- 
rithms/variations on CUSUM are available (|4), (6)). One common 
algorithm, called nonparametric CUSUM is to replace 10 by 



W k , 



max{0,W k i+X k i-D}, 



(4) 



where, D is an appropriate constant such that E[X k j —D] is nega- 
tive before change and positive after change. If the mean of X k j 
is known before and after the change, D can be chosen as the av- 
erage of the two means. Depending on the uncertainties involved, 
we should be able to obtain such a D. More generally, we should 
be able to estimate D. For Gaussian and exponential distributions, 
nonparametric CUSUM becomes CUSUM for some appropriate D 
and scaling. If at the fusion node go is not known (in our CUSUM 
algorithm ([3} at the fusion node, g/(x) = /fc + go(x)), then one can 
use J?) even at the fusion node. 

In the following we will compute Pp A and Eqq for a generalized 
class of algorithms where at the sensor nodes and at the fusion node 
we use the algorithm, 



Wit+i =max{0,lV jt +Z^ +1 }, 



(5) 



,1<1<L. (1) 



where, {Z k } is an iid sequence with different distributions before 
and after the change. We will assume that E[Z k ] < before the 
change and E[Z k ] > after the change. We will denote by fz,Fz 
and Pz the density, cdf and probability measure for Z k . 

Algorithm 10 contains CUSUM and nonparametric CUSUM as 
special cases. In the next section we analyze the generalized Du- 
alCUSUM with {5}. We emphasize that unlike DualCUSUM, this 
algorithm does not require knowledge of /o and f\ ; we only need 
to chose D appropriately. But, the performance of this algorithm, 



as we show in the next section, does depend on the underlying dis- 
tributions. This is typical of such algorithms. 

3. ANALYSIS 

In this section, we first compute the false alarm probability Pp^ 
and then the delay Ep,D- The idea is to model the times at which 
the CUSUM {W k } at the local sensors, crosses the threshold 7 (we 
drop subscript / for convenience) and the local nodes transmit to 
the fusion node (Fig. [TJ. 



Sojourn time above 7 




Figure 1: Excursions of W k above yean be approximated by a compound Pois- 
son process. A local node transmits to the fusion node during these excursions. 

Computing Pp^ requires finding (when Zj. has distribution /g) the 
distribution of Ty, the first time W k crosses 7, the amount of time it 
stays above 7 (sojourn time above 7), and the probability that the 
fusion node declares a change during a sojourn time. These are 
computed in Sections [3. 113.51 Computation of Ep>D is sketched in 
Section [3~7l 

We will need the following notations and definitions. Let X be a 
random variable with distribution F . Then F*" denotes the 77-fold 
convolution of F and F(x) = 1 — F(x). 

DEFINITION 1. (fiTj) F isheavy tailed if for any e > 0, £[e £ l z l] = 
00. F is subexponential if F* 2 (x) /F(x) — > 2 as x — > °°. If F is not 
heavy tailed, we call it light tailed. 

Gaussian, Exponential and Laplace distributions are light tailed. 
Pareto, Lognormal and Weibull distributions are subexponential. 
Subexponential distributions are a subclass of heavy tailed distri- 
butions. 

Often it is said that light tailed distributions may provide better 
system behavior than the heavy tailed ((3j). We demonstrate this 
for the probability of false alarm. In particular we will show that 
if Fz is light tailed before change then Pp^ is much less than if it 
is heavy tailed. Interestingly, we will also show that Eqq is largely 
insensitive to the tail behavior of Fz- 

CUSUM has the interesting property that it transforms a large 
class of heavy tailed distributions into light tailed distributions. 
Consider the class of Pareto distributions: 



K- 



K 

(K+\) 



for x > x m , where x m > 0. 



1) andfor/i, /ST: 



a where a is a posi- 

-za 



For /o, we take K = (ac- 
tive constant. Then, if Z = log(f l (X)/f Q (X)), P x [Z k > z] ~ e 
On the other hand, light tailed distributions stay light tailed. This 
important property of log likelihood seems to have escaped the at- 
tention of investigators before. This makes CUSUM perform bet- 
ter than the nonparametric CUSUM because as mentioned above, 
for CUSUM with Pareto distribution, the Pp^ will be as for a light 
tailed distribution. But for nonparametric CUSUM, it will be much 
larger, while the Ep,o in the two cases will be comparable. 



3.1 Behavior of Wk under 

The process {W k } is a reflected random walk with negative drift 
under P^. Figure (0 shows a typical sample path for {Vl^}- The 
process visits (regenerates) a finite number of times before it 
crosses the threshold 7 at, 



iaf{k> l:W k > 7}. 



(6) 



We call Ty the First Passage Time (FPT). The overshoot T = Wr - 
7. Let 



To = inf{k : k > T r ; W k < 0} - Ty 
77 = #{k:W k >r,T r <k<T r + To}. 



and 



(7) 



During time 77 (called a batch) a local node transmits to the fu- 
sion node. Thus, these are the times during which the fusion node 
will most likely declare a change. The overshoot F can have signif- 
icant impact on 77 . 

It has been shown in 1131 that the point process of exceedances of 
7 by W k , converges to a compound Poisson process as 7 — » °°. The 
points appear as clusters. The intervals between the clusters have 
the same distribution as that of Ty in ® and the distribution of TJ 
in (Q gives the distribution of the size of the cluster, i.e., the batch 
of the compound Poisson process. Since, one has to choose large 
values of 7 to keep Pp^ small, a batch Poisson process provides a 
good approximation in our scenario. 

In the next few sections we give results on the distribution of Ty, 
overshoot T, and the distribution of the batch 7] which will be used 
in computing Pp&. 

3.2 First Passage Time under P OT 

From the compound Poisson process approximation mentioned 
above, 



lim Poo{Ty > x} = exp( 



-XyX) 



x > 0, 



(8) 



where, Ay a positive constant. 

In J2| a formula for Ay was used which is computable for Gaus- 
sian distribution only. However, by solving integral equations ob- 
tained via renewal arguments ( 1141 ). one can obtain the mean of 
FPT for any distribution. Epochs when W k = are renewal epochs 
for this process. Let L(s) be the mean FPT with Wq = s > 0. Hence 
Ay = 1 /L(0). Then from renewal arguments: 

U?) = Fz{s)L(p) + J\(s + z)dF z (z)dz + P[Z > y-s}. 



(9) 



The equation is obtained by conditioning on Zo = z. If Zo < —s, 
then W\ = 0, providing the first term on the right. If Z > 7 — s, 
then the threshold is approached in one step only, providing the last 
term. This equation can be solved recursively on L(s),0 < s < 7. 
An algorithm provided in (9j can be used to compute ((9) efficiently. 
Table Q] provides E[Zy] for Pareto distribution with K = 2.1 and 
Gaussian distribution with EZ k = —0.5 and var(Zf.) = 1. We see 
that as 7 increases E[ty] for Gaussian distribution becomes much 
larger than for the Pareto distribution. This implies that Pp& for the 
Gaussian distribution should be much less than for Pareto, K = 2. 1 
if 7 is large. 
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E\t r \ Pareto 
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930 


800 


6 


2551 


1100 
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6950 


1455 


8 


19020 


1880 



Table 1: Mean FPT Etey] for Pareto (X = 2. 1) and Gaussian with EZ k = -0.5. 



3.3 Distribution of overshoot 

Next we consider mean and distribution of the overshoot T. From 
renewal equations as in ((9}, we can exactly compute ET for any dis- 
tribution. If R(x) = ET with W = x, 

R(x) = E[Z k -(r-x)\Z k >(r-x)]Pz(Z k >Y-x) 

+ [ 7 R(y)fz(y-x)dy + R(0)F z (-x). (10) 

Jy=0 

Mean overshoot ET = R(0). For light tails ET converges quickly 
to a constant value as y — > °°. 

The distribution of the overshoot for heavy tails is reported in 
171 . One can show that for light tailed distributions, overshoot is 
exponentially distributed. One can also obtain exact distribution 
using renewal integral equations as in l[9j and Goj. We plot the 
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Figure 2: Complementary CDF of T for Pareto K = 2.1, EZ t = -0.3 and 
var(Zi) = 1 and y = 8 




Figure 3: Complementary CDF of r for Z f ~JV( -0.3,1) and y> 6 



distribution of overshoot for Pareto distribution with K = 2.1 in 
Figure 10 and for Gaussian distribution in Figure 0, The mean 
overshoot £[r] was obtained using equation d 1 0b . The distribution 
of r for Pareto distribution was obtained via the result in |T) and 
for the Gaussian distribution by exponential distributions. 

Comparing Figures and 0, we see that the analytical approx- 
imations are very good. Also, the overshoot for Pareto distribution 
is much more than for the Gaussian distribution. 

3.4 Distribution of the Batch 

3.4.1 Distribution of batch for heavy tail 

Theorem 2.4 of fTj provides the batch size distribution for subex- 
ponential Z. Figure shows the plot of Batch complementary 
CDF for Pareto distribution with parameters K = 2.1. One sees a 
good match with simulations. 



3.4.2 Distribution of batch for light tail 

Let Gj(x) be the conditional batch distribution, Gj(x) = P(rj < 
iWx r = J + X )y when the overshoot is x. Gj(x) was obtained in (2) 
by the Brownian Motion (BM) approximation of {W^}. We use this 
approximation along with exponential distribution for T to obtain 

for light tailed distributions. Figure plots the distribution of r\ 
for Zy_ with Laplace distribution via Jilt and via simulations. 

For Lognormal distribution, which can be approximated via both 
heavy tailed and light tailed approximations provided above, Jl lb 
provides a better approximation. 

Comparing Figures <J4]» and 10 one sees that the batch size for 
a Pareto distribution is larger than for a Laplace distribution even 
when they have same mean and variance. This is a direct conse- 
quence of having larger overshoots. 




40 80 120 160 



Figure 4: Complementary CDF of Batch ij for Pareto K = 2.1, EZ k = -0.3 and 
var(Z t ) = 1 and y= 15. 
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Figure 5: Complementary CDF of Batch I) for Laplace Z k with EZ k = -0.3 
and var(Zk) — 1 and y> 7. 

3.5 False Alarm Analysis 

The false alarm in DualCUSUM can happen in two ways: one 
within a batch (we denote its probability by p) and another outside 
it, i.e., due to {Zmac,*}- We will compute these later on. Now, we 
compute the Pp& from these quantities. 

From the assumptions made and the above approximation, the 
inter-arrival time of the batches in the system (at the fusion center) 
is exponentially distributed with rate LXy (because the processes 
{Wjt/} are independent for different nodes each generating batches 
as Poisson processes with rate X). Then, the number of batches 
appearing before the time of change is a Poisson random variable 
with parameter LXyi, when T = i. In the following, we will show 



that the time to FA outside a batch is exponentially distributed with 
parameter Xq (to be defined below). Therefore, if T ~ Geom(p), 
then one can show that: 



Pfa = 1 - 



l_ e -(VrV#)(l-p)' 



(12) 



Similarly, one can obtain expression for Pp^ when T is not geomet- 
ric. 

False Alarm within a Batch: 

We have seen above that for light tailed Zj, the E[Zy] is large and 
the batch sizes are small. Thus, the batches by different local nodes 
do not overlap. However, it is not true for heavy tailed distributions. 
Thus we compute p for the two cases separately. 

Light Tailed 

The false alarm probability (p) within a batch, can be computed 
as, p » ££Lj P(t] = ()P(FA 77 = i), where P(FA \l) = i) represents 
the probability of FA (CUSUM at the fusion center crossing jS) in 
i transmissions when one local node is already transmitting, i.e., 
Yk = b + Zmac,1c- If T /3 is the FPT variable at the fusion center, 
then, P(FA \ rj = i) = P(Tp < i). Since 7] is small for negative drift 
under fy (D in © is chosen that way) we use integral equations 
to compute the distribution of Xp for observations given in this 
paragraph. 

Tablef2]gives the comparison of the Pp A values obtained via j 1 2b 
and simulations for light tailed distributions Gaussian and Laplace. 
It turns out that the expression is also valid for heavy tails like Log- 
normal (also shown in Tablef2]l. One can see a good match. 

Heavy Tailed 

Now, we use different arguments to compute p and then use it in 
J 1 2b . For simplicity, in the following, the fusion center is assumed 
to use (O for detection and not nonparametric CUSUM. From (2), 
the optimal choice of / is found to be always greater than 1. 

Let m be the minimum number of sensors required to make drift 
of positive. We denote the drift, with m nodes transmitting, by 
fl,„. Then we approximate p by the probability that Fj will have 
positive drift during a batch and that the batch lasts for /3 / fl m time 
(the time needed for F* to cross /3 when the drift is /!„,) after m 
sensors start transmitting. We use this approximation to compute 
Pfa for Pareto K = 2. 1 distribution. This is also provided in Table 
[2] We see that the approximation is indeed good for Pareto K = 2.1. 
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16 


16 


0.66 


0.55 


Log- 
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25 
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1.76 


normal 


10 


2 


25 


20 


2.97 


3.5 


Pareto 


5 


3 


30 


30 


1.93 


1.77 


K=2.1 


5 


3 


50 


50 


0.23 


0.25 



Table 2: Pfa for various distributions using {4} at the local node and (3) at the 
fusion node: EZ k = -0.3, \>ar(Z t ) = 1, p = 0.005 and b = l. 

False Alarm outside a Batch 

In the absence of any transmission from the sensors, F* ~ ^(0, 0^ AC ) 
if Zmac ~ ^(0) °mac)' wnere N(0, c^c) denotes Gaussian distri- 
bution with mean and variance 0^j AC . Hence, F^ has negative 



drift. Thus the time to first reach /3, i.e., time till FA, is approx- 
imately exponentially distributed with parameter A<) which can be 
obtained from SectionfOl 



3.6 Comparative overall performance 

The effect of tail of on FPT, overshoot and batch size was 
shown in the previous sections. This causes much larger PpA for 
heavy tailed Z# compared to the light tailed distributions for same 
mean and variance. This gets reflected into large Edd for heavy 
tailed distributions for a given Pp A . Table [3] confirms these con- 
clusions as the Er>r> f° r a light tailed system is much smaller as 
compared to the one from a heavy tailed system. The individual 
systems are optimized to make sure that each performs at its best. 

Table [4] shows the comparative performance of paramentric and 
nonparametric DualCUSUM's for given fy and f\. The difference 
in performance is most pronounced when the tail of /o is heaviest, 
i.e., for K = 7, while the performance is same for Gaussian distri- 
butions on which log likelihood function has no effect. 

Note that in Table[4] the variance of Zj is different for parametric 
and nonparametric CUSUMs. The overall effect is thus a combina- 
tion of the effect of tails and that of the variances. However, as can 
be seen from the table, the effect of tail dominates and the general 
conclusion that light tailed systems are better, still holds. 

Edd m Tables [3] and |4] are computed via simulations. However 
in the next section we theoretically evaluate Edd and then compare 
with the simulated values. 
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Pfa 


E DD 

Gauss 


E DD 

Pareto 
K=2.1 


5e-4 


e-2 


29 


36 


5e-4 


e-3 


33 


49 


le-4 


e-3 


41 


95 



Table 3: Comparison of E* DD of Gaussian (/" — 3) and Pareto (/* — 4)withL — 5, 

S = 5, EZ t = -0.5, var(Z k ) = 1 and Z M ac = N(0,\). 

3.7 Computation of Edd 

The detection delay, Edd, at the fusion node, after the change 
has occurred, can be written as, 



Edd '- 



E[x-T\x>T){\-P FA ) 



(13) 



The Edd is atleast equal to the time it takes for some of the sen- 
sors to transmit and make the drift of F^ positive. After that the 
additional delay is due to the time it takes for F^ to cross /3 with 
this positive drift, or due to the additional positive drift given by 
transmissions due to the other remaining sensors. This concept is 
used to compute the Edd in the following. 

When jl = EZi< > 0, the time Ty for at a local node to cross 
threshold y satisfies, £[Ty]/y— > l/jl as y ^ <*>. Thus for large y, 
Ty ~ y//x. Again for large y, one can invoke Central Limit The- 
orem and approximate Ty by a Gaussian random variable: ty ~ 

N{$,$), where, a 1 = var(Z k ). 

Let fit be the drift of fusion CUSUM F^ when / local nodes are 
transmitting. 

Let tj be the point at which the drift of F/, changes from i to 
i.e., when the drift of Fj is already and one additional node 
(out of the remaining L — i + 1) starts transmitting. Call t, the ith 
transition epoch. Let T x \ , X x 2, ■ ■ ■ , T A -x be iid with distribution of 
T x . Then t\ , the time at which the first transmission happens, can 
be written as, t\ = M(L, y) = min(Ty.i , Ty2, • • • , fyz)- Let m(L, y) = 
E[M(L, y)] = E[ti]. Then E{t 2 }~m(L,y)+m(L- 1, y-m(L, y)/i). 
The term m(L— 1 , y — m(L, y)/i) represents the mean time for one 



of the remaining L — 1 sensors to cross 7 given that their respective 
CUSUMs have run for t\ slots, i.e., if %_\ = y - m(L,y)fl, then 
m{L-l,y-m(L,Y)n) = E[]ma(T:y L _ u i,Zy t _ h 2,...,^ rL _ u L-i)]- In 
t\ slots each of the W k has moved up by approximately E[t]}fl and 
hence the correction. This way we can compute the approximation 
to mean of each of the transition epochs: 

i 

E[U] = £m(Z,»), 

l=L 

where, % = 7 and 7/ = 7/ + j — fl.m(l + 1, 7/+i) and i < L. It may 
happen for large values of fl that, m(k, y k ) < 1; for some k. In that 
case we set m(i, 7) = 0,k > i > 2. 
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E DD 
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54.8 


4.4 


Pareto x m — 1 
A" — 40 to A" — 30 


5/4 


69.1 


24.9 


Gaussian o" — 1 
EZt = to EZ t = 0.6 


5/3 


10.1 


10.1 



Table 4: Comparative performance of parametric and nonparametric Dual- 

CUSUM for P FA = 0.01 with p = 0.05, 4 = 7.61, and Z MAC = N(0, 1). 

Now define, F = £[F f/ _i], the mean value of F k just before the 
transition epoch Then a recursion can be written to approxi- 
mately compute the Ff. 

Fk=F k _i + \ {Wm >0} [ftt_ im(L-k+l, y L _ k+ l)] ; Ft = 0. 

Let, j = min{/' : /I; > and < F[f, + 1 ]}, then £yj£) can be ap- 
proximated by, 

SflD««[ff]+^, (14) 

where, m(&,jc) can be computed as, m(k,x) = YZ=0^(^ x > ')*• 

Since, the strong law of large number and the central limit theo- 
rem suffice to build the approximations, the Edd is independent of 
the distribution of Z k but depends only on its mean and variance. 
The results are shown in Table [5] for different distributions. The 
second column is our approximation d!4t and the rest are obtained 
via actual system simulations. It can be seen that, as y reaches 50, 
the Edd of all the distributions considered is nearly 147. 
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K=3 
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5.3 


9.1 
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9.3 


10.7 
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16.6 


16.8 


16.9 


18.7 


15 


30.3 


36.3 


36.5 


36.7 


38.5 


50 


146.7 


146.8 


147.1 


147.6 


150.5 



Table 5: Comparison of Edd for various distributions: L — 10, / — I,j8 — 7 
EZ k - -0.3, Var(Z fc )=l and b = L 



4. CONCLUSIONS 

We have proposed an energy efficient distributed change detec- 
tion scheme in (2) which uses the physical layer fusion technique 
and CUSUM at the sensors as well as at the fusion center. In this 
paper we extend the algorithm to also include the nonparametric 
CUSUM. We have theoretically computed the probability of false 
alarm and mean delay in change detection. The analytical results 
provide a good approximation which can then be used to choose 
the optimal parameters using the optimization technique in |]2j. 
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